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and Jeremy H. Lakey* 



Regenerative medicine demands the recreation of complex cell- 
cell and cell-matrix interactions observed in vivo^ and has led to 
the development of artificial biomaterials to mimic the protein 
network in the extracellular matrix (ECM)J 2 ' 3 ^ Future develop- 
ments would benefit from an economic supply of protein poly- 
mers which closely match the molecular structure of the natural 
material. The polymeric Cafl protein, from the plague bacterium 
Yersinia pestis, forms an enveloping hydrogel whose role is to 
inhibit interactions with host cells. ^ Furthermore, it shares a 3D 
structure with the largest class of human extracellular proteins. 
Here we show that recombinant Cafl is a highly adaptable scaf- 
fold for a synthetic biology route to protein polymer engineering. 
It is robust, resisting both thermal and proteolytic degradation 
and when purified it retains its biological ability to prevent mam- 
malian cell attachment; three advantageous properties difficult to 
design into a protein de novo. We then reverse the "non-stick" 
phenotype by inserting a cell adhesion motif, express mixed poly- 
mers of different subunits and form hydrogels using a simple 
cross-linker. Such animal free proteins, economically produced 
in E. coli, offer a new family of tissue culture materials. 

Gram-negative bacteria often form protein polymers on their 
surfaces via the chaperone-usher (CU) pathway,^ so named 
because monomeric subunits, secreted into the bacterial peri- 
plasm, are initially stabilized by a specific chaperone proteinJ 6 ] 
Subsequently this binary complex interacts with an outer 
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membrane "usher" protein^ that provides a channel through 
to the extra-cellular surface. The nascent polymer grows out 
through this usher and across the outer membrane by the 
addition of monomeric subunits to its periplasmic end. Each 
monomer donates a single beta-strand to the preceding mon- 
omer thus joining the subunits by a strong but non-covalent 
link. Many CU proteins of pathogenic bacteria are so called 
adhesins, involved in binding to host cells, ^ but the bacterium 
Yersinia pestis, the etiologic agent of the bubonic plague, pro- 
duces a unique form of CU polymer. After being injected into 
a warm blooded host via a flea bite, [9] a temperature sensitive 
promoter expresses the cafl gene to produce a thick polymeric 
CU hydrogel coating on the surface of the cell. This efficiently 
resists cell interactions and, by acting as a kind of anti-adhesin, 
inhibits macrophage attacks.^ 

The formation of the Y. pestis Cafl protein has been 
described by a series of elegant papers from Knight and co- 
workers who solved the high resolution structures of the 
chaperone (CaflM-Cafl) and also revealed the energetics 
of the polymer formation J 10,1 ^ We recently determined, by 
electron microscopy, the structure of Cafl and revealed for 
the first time the conformational flexibility and large size of 
these polymers J 12 ^ They were up to 2 um in length consisting 
of 800 monomers with molecular weights of up to 11 MDa. 
Usefully, these show no aggregation (apart from some inevi- 
table knots) and remain soluble at >50 mg mlr 1 . Furthermore 
the polymers are stable up to 90 °Cj 13 l protease resistant and 
easily purified. [14 J 

Models of Cafl structure^ 12 ' 15 ] and the structure of the similar 
Saf protein^ 16 ] predict it to be a polymer of immunoglobulin 
like domains (Figure 1A and B). As such it resembles the pre- 
dominant family of extracellular proteins in humans (-3% of 
human protein-coding genes) which includes fibronectin type 
III repeats. [17,18] Since Cafl also displays highly desirable prop- 
erties (non-adhesion, stability and ease of production) that are 
difficult to design de novo into protein polymers, we investi- 
gated whether it could be a useful animal-free ECM substitute. 

To imitate fibronectin, we inserted into Cafl the Arg-Gly- 
Asp-Ser peptide (RGDS) which has been shown to confer cell 
adhesion properties similar to the whole fibronectin mole- 
cule. [19] The RGDS motif was incorporated into surface loops in 
order to expose the motif as in fibronectin, ^ limit the changes 
to the structure of Cafl and also to avoid important sites of 
Cafl: chaperone/ usher interactions J 10 l Inserts were modeled 
using PyMOli 20 ] and the published coordinates for Cafl (PDB 
file: 1Z9S) [10 1 and finally five mutants were expressed and puri- 
fied (Figure 1A). 
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Figure 1. Expression of engineered Cafl . A) Upper molecule in cyan; Fibronectin Type III domain (PDB File 1 FNFt 17 ! with sites of known cell adhesion 
motifs (RGDS and the accessory site PHSRN) highlighted in magenta. Lower molecule in green; Cafl model based upon X-ray and EM structu rest 1 0,1 2 1 
with RGDS insertion sites (loops) numbered. B) Linear, bead on a string, structure of Cafl revealed by negative stain transmission electron microscopy. 
Box is 400 nm across. C) Expression of Cafl showing pelleted cells and Cafl rich flocculent layer. SDS-PAGE Coomassie blue stained gel with heat 
treated flocculent layer sample (left) and M, molecular weight markers (molecular mass x 10 3 kDa arrowed) (right). WB-Western blot using anti-Cafl 
antibodies. Pelleted cells showing lack of flocculent layer in empty vector and Loop2 RGDS samples whilst a clear flocculent layer is present in the 
Loop 5 mutant tube. D) Analysis of purified Cafl polymers by western blotting using a mouse monoclonal anti-Cafl antibody. WT (lanes 1-3), Loop5 
RGDS (lanes 4-6) and Loop5 RGES (lanes 7-9). Lanes 1, 4, 7 heated at 95 °C for 5 min showing only monomeric Cafl. Lanes 2, 5, 8 heated at 95 °C 
for 45 s showing a ladder of Cafl multimers. Lanes 3, 6, 9 unheated showing only high MW polymers. E) Far UV-CD spectrum of WT, Loop5 RGDS and 
Loop5 RGES Cafl polymers. Each curve represents the average of 10 accumulated spectra measured at a concentration of 0.5 mg mL" 1 Cafl (0.05 cm 
path length cell). The sample contained 50 mM sodium phosphate, pH 7.2. Each spectrum was corrected by subtraction of a comparable blank. The 
abscissa is in units Ae (M _1 cm" 1 ) where M is the molar concentration of amino acid residues. 



The expression of cafl, from its own temperature dependent 
promoter, was revealed by the presence of a flocculent layer (FL) 
above the cell pellet (CP) after centrifugationt 14 ! (Figure 1C). 
We analyzed this polymer rich layer by SDS-PAGE using three 
methods of preparation, no heat denaturation in which the 
polymer is intact (Figure ID lanes 3, 6, 9), limited heat denatur- 
ation (45 s at 95 °C) in which a ladder of oligomers is formed 
(Figure ID lanes 2, 5, 8) and full heat denaturation (5 min at 
95 °C) in which mostly monomers are observed^ 12 ! (Figure ID 
lanes 1,4,7). In the heat denatured sample a significant band ~ 
15 kDa was confirmed as Cafl monomer by western blot and 
peptide mass finger printing (Supporting Information, Figure 
SI and 2). 



Of the five mutants, L2RGDS (Loop 2 insertion) did not 
express Cafl, L5RGDS had the highest yield and thus we 
constructed a loop 5 Cafl RGES mutant (L5RGES) as a non- 
cell-adhesive control^ and a high yield of L5RGES was also 
obtained (Figure 1C). Both formed WT-like polymers of high 
molecular weight (Figure ID) and the thermal transition tem- 
perature of unfolding (Tm), obtained by Far-UV CD (Figure IE) 
and DSC, were little changed Cafl WT (DSC = 86 °C, CD = 
83 °C), Cafl L5RGDS (84/83 °C), and Cafl L5RGES (83/81 °C). 
The far-UV CD spectra of Cafl RGD/ES L5 show slight differ- 
ences to that of CaflWT. This is an unusual spectrum, quite 
unlike classic the beta-strand structures, so the reasons for 
the weaker signal at 205 nm is unclear. Thus we checked the 
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structure by near UVCD and electron micros- 
copy. The spectra for near UV CD (Sup- 
porting Information, Figure S3 A) for the Cafl 
WT, Cafl RGDS L5 and RGES L5 again show 
a similar structure. The main structural char- 
acteristics revealed on the near UV CD spec- 
trum previously^ 12-14 ] such as the two minor 
(at 262 and 269 nm) and major peaks at 283 
and 290 nm were observed in this study. 
Analysis of transmission electron microscopy 
images of negatively stained Cafl fibres was 
performed using Jmicrovision softwareJ 22 ^ 
We determined the length of Cafl WT, Cafl 
RGDS L5, Cafl RGES L5 fibres and the mean 
fiber lengths were 401, 323, and 257 nm, 
respectively (Supporting Information, Figure 
S3B). However it was evident, by inspecting 
the box charts, that there was a great het- 
erogeneity of Cafl fiber length in all sam- 
ples (Supporting Information, Figure S3B). 
Similar results for Cafl WT fiber length were 
observed by Soliakov and colleagues. [12] 

Next, to assess Cafl's suitability for cell 
culture, cell viability was assayed via calcein 
uptake. When cultured on surfaces coated 
with Cafl-RGDS, Cafl-RGES and Cafl-WT 
neither 3T3 fibroblasts nor PC 12 cell lines 
revealed any toxicity within 48 h (Supporting 
Information, Figure S4). However, 3T3 cells 
proliferated more on fibronectin. Fibronectin 
is a large protein which contains several 
other important peptides for cell proliferation 
such as PHSRN, LDVP and IDAP and so the 
challenge is to incorporate several different 
motifs in Cafl fibers as shown later. 

We then studied cell morphology and 
adhesion in detail using scanning elec- 
tron microscopy (SEM). Each cell line was 
grown on 12 mm glass cover slips pre-coated 
with either CaflWT, L5RGDS, L5RGES, 
Fibronectin or Collagen IV, incubated for 24 h, 
fixed and visualized by SEM (Figures 2, 3). 
CaflWT inhibits adhesion of both PC12 and 3T3 cells whereas 
this effect is reversed on L5RGDS (Figure 2A and B). To quan- 
tify cell adhesion on Cafl protein-coated glass surfaces and on 
the control surfaces: collagen IV and fibronectin, ten images, of 
each cell line were examined. The total number of cells counted 
varied between 558 for PC12 on collagen IV to 9 for 3T3 on 
WT Cafl and the percentage of cell adhesion was determined 
(Supporting Information, Table S2). This revealed that approxi- 
mately 80% of 3T3 cells bound to the L5RGDS and fibronectin 
surfaces whilst only 3% bound to the WT Cafl surface. How- 
ever, the most striking differences were apparent when the 
shapes of the adherent cells were analyzed according to their 
interaction with the surface i.e., do they present one or more 
filopodia (cytoplasmic projections) (Figures 2 and 3). Thus 
CaflWT provided a surface where the cells were invariably 
round and showed no projections. Critically, this behaviour of 
3T3 and PC 12 was largely reversed by the simple insertion of 
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Figure 2. Fibroblasts on Cafl polymers. Glass slides were incubated with each of the pro- 
teins shown then used to culture mouse 3T3 fibroblasts for 24 h before being finally fixed and 
imaged by scanning electron microscopy. A) WT Cafl polymer. B) Loop 5 RGDS Cafl polymer. 
C) Fibronectin. D) Buffer treated glass - no protein. E) Control Loop 5 RGES polymer. Histo- 
grams show differences in cell morphology. Non-round cells show one or more filopodia. Data 
represent the mean of three experiments ± standard error of the mean (S.E.M). Significance 
was determined by one way ANOVA analysis with Scheffe as a post hoc test was conducted. 
(*) P<0.01 compared to fibronectin, (**) P< 0.001 compared to fibronectin. 



the RGDS motif, such that the results were identical to those 
on fibronectin (Figures 2 and 3 and Supporting Information, 
Figure S5). Whilst the results for both 3T3 and PC 12 cells on 
L5RGDS mimic those of fibronectin, PC12 cells, as expected, 
grow rather better on collagen (Figure 3). 

Previous work on other CU proteins has shown that when 
two subunit genes are expressed in the same cell they com- 
bine in the same polymerJ 23 ^ This offers a synthetic biology 
approach whereby a series of different monomers could be 
expressed in a single cell under the control of different pro- 
moters to create a range of mixed polymers on demand. To 
demonstrate this possibility for the Cafl polymer system we 
expressed two different genes with two plasmids each carrying 
a different origin of replication, selection marker and promoter 
(Figure 4A); pAH34L containing the complete wild type Cafl 
operon [14] and pBAD33 [24] which expressed only a mutant cafl 
gene which included a FLAG epitope termed cafl-FLAG. The 
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Figure 3. PCI 2 cells on Cafl polymers. Glass slides were incubated with each of the proteins 
shown, used to culture rat pheochromocytoma PCI 2 cells for 24 h before being finally fixed and 
then imaged by scanning electron microscopy. Top; low magnification images to display differ- 
ences in cell numbers on WT Cafl , Loop 5 RGDS Cafl and collagen IV polymers. Lower images; 
comparison of cell morphology on the different polymers WT Cafl polymer; Loop 5 RGDS Cafl 
polymer; Collagen IV polymer; Loop 5 RGES polymer; Fibronectin (FN) and Buffer treated glass 
- no protein (NP) Histograms show differences in cell morphology. Non-round cells show one 
or more filopodia. Data represent the mean of three experiments ± standard error of the mean 
(S.E.M). Significance was determined by one way ANOVA analysis with Scheffe as a post hoc 
test was conducted. All treatments were P < 0.001 compared to collagen IV. 



expression of cafl encoded by pAH34L is temperature sen- 
sitive whilst the expression levels of cafl-FLAG encoded by 
pBAD33 can be modulated over a varied range of L-arabinose 
concentrations. Analysis of co-expression of Cafl mutants was 
performed by western blot. Since the pBAD33 plasmid carries 
neither chaperone nor usher protein genes the cafl-FLAG did 
not form polymers by itself (Figure 4C lane 6). As earlier shown 
in Figure ID, the western blot in Figure 4B shows that WT 
pAH34L plasmid formed a polymeric protein. No full length 
polymer protein was detected by western blot in the unheated 
sample (Figure 4B lane 7) because transferring such large poly- 
mers to the nitrocellulose is inefficient. However, dimers and 
trimers can be seen in the sample heated for 45s (Figure 4B, 
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lane 8) and only monomer is observed in the 
fully heat denatured sample (Figure 4B lane 
9). Finally, co-expression of the two plasmids 
produced FLAG labeled dimers and trimers 
when heated for 45s (Figure 4C lane 2). Thus, 
cafl-FLAG must have assembled into poly- 
mers using the WT usher protein supplied 
by plasmid pAH34L. Since the corresponding 
lane 2 in Figure 4C also reacts with anti-Cafl 
antibodies it is thus reasonable to assume 
that we have produced mixed polymers com- 
posed of the products of both plasmids and 
not pure polymers of the different subunits. 
(Figure 4B and C lane 2). Thus, Cafl-FLAG 
was detected outside the cell in the floccu- 
lent layer (Figure 4C) showing for the first 
time the export of Cafl hybrid polymers. In 
future the scaffold could more closely mimic 
the complexity of ECM by containing several 
functional monomers including different cell 
adhesion motifs e.g., PHSRN (Figure 1A) or 
protease remodelling sites. 

The Cafl polymer can also be used to form 
a hydrogel with possible applications in 3D 
cell culture. WTCafl at a final concentration 
of 30 mg mL" 1 produces a viscous solution 
which we stabilized by the addition of amine 
reactive cross linkers of various lengths 
DTSSP (12.0 A), NHS-PEG-NHS (197 A), 
and 4-arm NHS-PEG (2 x 197 A). These can 
react with any of the eight surface lysines 
on each Cafl monomer. The Cafl hydrogels 
were characterized using a simple tube- 
inversion assay in which non-cross linked 
solutions would flow down the sides of the 
plastic reaction vessel. The gelation time 
was visually estimated (Supporting Informa- 
tion, Table S3) to be within 24 to 27 min for 
NHS-PEG-NHS and 2 to 22 min for 4-arm 
PEG-NHS, depending on the concentration 
of the cross-linker. The higher the concen- 
tration of these two cross-linkers the quicker 
the gelation time. With DTSSP a solid gel 
was not observed. The increase in gelation 
rate of 4-arm PEG-NHS could be due to the 
structure of 4-arm PEG which influences its 
ability to react with the primary amine groups of Cafl. [25] The 
gelation time reported here is comparable with other studies 
using PEG hydrogels, for example Liu et al., (2012)[ 26 1 who 
examined the combination of PEG diacrylate (PEGDA) and 
acryloyl- P E G - RG D . 

After cross-linking for 30 min, samples were analyzed on a 
4-20% gradient gel. The band of approximately 15 kDa corre- 
sponding to the Cafl monomer in the non-cross linked Cafl 
sample (control) was used as a reference for the subsequent 
analysis. The relative density of the unreacted Cafl monomer 
band, in Cafl samples cross-linked with the different cross- 
linkers at various concentrations, was determined. Since the 
calculations of the relative density for the high molecular weight 
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Figure 4. Complex polymers and hydrogels. A) Schematic of the co-expression of WTCafl with Cafl -FLAG using the plasmids pAH34L (WTCafl , chap- 
erone and usher genes) and pBAD33 (Cafl -FLAG only). WT and mutant monomers expressed from different plasmids create mosaic Cafl polymers 
composed of the two different subunit types. WTCafl represented in red and Cafl -FLAG in green. (B) pBAD33_SD_cafl NT-FLAG + pAH34L probed 
with anti-Cafl antibody. M, molecular weight marker proteins (molecular mass kDa); lane 1 , pBAD33_SD_cafl NT-FLAG + pAH34L sample non-heated; 
lane 2, pBAD33_SD_cafl NT-FLAG + pAH34L sample heated at 95 °C for 45 seconds; lane 3, pBAD33_SD_cafl NT-FLAG + pAH34L sample heated at 
95 °C for 5 minutes; lane 4, pBAD33_SD_cafl NT-FLAG sample non heated; lane 5, pBAD33_SD_cafl NT-FLAG sample heated at 95 °C for 45 seconds; 
lane 6, pBAD33_SD_cafl NT-FLAG sample heated at 95 °C for 5 minutes; lane 7, pAH34L sample non-heated; lane 8, pAH34L sample heated at 95 °C 
for 45 seconds; lane 9, pAH34L sample heated at 95 °C for 5 minutes. (C) Identical samples probed with anti-Flag antibody. Lane 2 shows incorporation 
of Flag-mutant from pBAD33 into Cafl polymers. The monomer bands on Figure 4B and C have run at different levels, largely due to gel differences 
(see markers) but possibly also due to the introduction of the charged FLAG epitope D,E) SEM of freeze dried Cafl hydrogel crosslinked with 4 arm 
PEG. Scale bar (D) = 500 uM and scale bar (E) = 50 uM. (F) ESEM of Cafl hydrogel crosslinked with 4 arm PEG. Scale bar = 50 uM. 



bands were more complex, these were considered a single band 
and referred to as the "Cafl cross-linked fraction". The 4-arm 
PEG-NHS showed the least amount of residual monomer and 
thus a higher degree of cross linking (Supporting Information, 
Table S4) 

Cafl polymers cross-linked with a short arm length cross- 
linker such as DTSSP (12.0 A) promote a closer contact between 
the Cafl fibres. Thus, the images obtained by TEM revealed a 
compact Cafl hydrogel which reflects the Cafl fibres proximity. 
A different result was obtained for Cafl polymers cross-linked 
with a long spacer NHS-PEG-NHS (197 A), which can better sep- 
arate the Cafl fibres. Large Cafl hydrogel meshes were observed 



by TEM. When we used the 4-arm PEG-NHS where each cross 
section is also 197 A the interactions between Cafl and the 4-arm 
PEG-NHS the structure was more condensed than that produced 
by NHS-PEG-NHS (Supporting Information, Figure S6). 

When using a monomeric form of Cafl made by circular 
permutation, cpCafl, [13] these large networks were not seen 
(Supporting Information, Figure S7). A highly porous hydrogel 
(Supporting Information, Figure S7) could be advantageous for 
swelling and water uptake and also as a scaffold for cell culture 
that can allow the passage of nutrients, oxygen through the pores. 
The TEM images confirmed that the formation of the Cafl hydro- 
gels depends on the cross-linker concentration and structure. 
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The pore diameters of 4-arm PEG-NHS stabilized Cafl 
hydrogels were assessed by SEM and ESEM (environmental 
SEM)J 27 1 For SEM we used freeze-dried hydrogels which broke 
into fragments during analysis (Figure 4D) and revealed a 
mean pore diameter of 8 ± 1.9 um (Figure 4E). ESEM analysis 
avoids the dehydration process and these images revealed a 
mesh-like network structure with a mean pore diameter of 300 
+ 0.3 nm (Figure 4F). 

As shown by Lutolf and Hubbell (2005 ) [3] in tight gel net- 
works, cells are required to use proteolytic strategies to degrade 
the surrounding matrix and be able to migrate through the gel. 
Cells can remain viable in stiffer gels, however they cannot 
spread and proliferate which is fundamental for obtaining 
cell-cell contacts and subsequent tissue formationJ 28 ] The Cafl 
polymer thus can be made to imitate different ECM by having 
varied sub-unit composition, different crosslinking ratios and 
possible protease cleavage sites. Its ease of production in an 
inexpensive and safe host bacterium means that it is a viable 
bulk product able to replace expensive cell culture reagents. 



Experimental Section 

For details of the experimental methods used please see the Supporting 
Information. 
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Supporting Information is available from the Wiley Online Library or 
from the author. 
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